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Abstract 

The population genetic structure of sea bass (Dicentrarchus labrax) along a 
transect from the Atlantic Ocean (AO) to the Eastern Mediterranean (EM) Sea 
differs from that of most other marine taxa in this area. Three populations 
(AO, Western Mediterranean [WM], EM) are recognized today, which were 
originally two allopatric populations. How two ancestral genetic units have 
evolved into three distinct units has not been addressed yet. Therefore, to inves- 
tigate mechanisms that lead to the emergence of the central WM population, 
its current status, and its connectivity with the two parental populations, we 
applied 20 nuclear loci that were either gene associated or gene independent. 
Results confirmed the existence of three distinct gene pools, with higher differ- 
entiation at two transitional areas, the Almeria-Oran Front (AOF) and of the 
Siculo-Tunisian Strait (STS), than within any population. Significant linkage 
disequilibrium and heterozygote excess indicated that the STS is probably 
another tension zone, as already described for the AOF. Neutrality tests fail to 
reveal marker loci that could be driven by selection within or among metapop- 
ulations, except for locus DLA0068. Collectively, results support that the central 
WM population arose by trapping two tensions zones at distinct geographic 
locations of limited connectivity. Population assignment further revealed that 
WM individuals were more introgressed than individuals from the other two 
metapopulations. This suggests that this population might result from hybrid 
swarming, and was or is still seeded by genes received through the filter of each 
tension zone. 



Introduction 

It is well established that, despite the apparent connectiv- 
ity of marine habitats and the high dispersal capabilities 
of marine organisms, populations may nevertheless be 
organized into well-defined genetic units (Hellberg et al. 
2002; Hauser and Carvalho 2008; Hellberg 2009; Sanford 
and Kelly 2011). If present-day processes can contribute 
to patterns of differentiation (Galindo et al. 2006; Gerlach 
et al. 2007; Selkoe et al. 2008, 2010), recognized phyloge- 
ographic barriers shared by a wide range of species 
represent the main barriers to gene flow in the marine 
environment (e.g., Benzie 1999; Barber et al. 2000; Patar- 



nello et al. 2007; Galarza et al. 2009; Kelly and Palumbi 
2010). 

Nevertheless, nearly all marine organisms have sporadic 
opportunities to cross almost all physical barriers and 
to exchange genes between phylogeographically differenti- 
ated populations. Opportunities for any neutral or 
unconditionally favorable mutation to invade most of - 
or the whole - the species range frequently occur after 
secondary contacts, so gene pools should be homogenized 
on either side because minimal gene migration easily 
overcomes mutation and drift over generations. Genetic 
differentiation persisting across phylogeographic barriers 
probably indicates nonneutral limitations to gene flow, as 
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exemplified by numerous marine hybrid zones worldwide 
(e.g., Gardner 1997; Strelkov et al. 2007; Hobbs et al. 
2009; Gagnaire et al. 2011). The classical "porous" (or 
"semipermeable") genome model states that differentia- 
tion would be wiped out from most genomic regions, 
except around loci conferring fitness advantage to one of 
the populations or, on the contrary, loci acquired during 
isolation and involved in endogenous incompatibilities of 
the Dobzhansky-Muller type (e.g., Barton and Hewitt 
1985; Mallet 2005; Payseur 2010; Bierne et al. 2011). Such 
postzygotic barriers contribute to natural selection against 
hybrid F 1; F 2 , and backcrosses (Nosil et al. 2005; Burton 
et al. 2006; Marshall et al. 2010), and may reveal costs of 
adaptation for immigrants (Hereford 2009). They, hence, 
contribute to the establishment of so-called tension zones 
reflecting the conflicting effects of dispersal of parental 
forms and selection against hybrids (e.g., Barton and 
Hewitt 1985). As the porous genome model implies wide 
heterogeneity in the distribution of differentiation along 
chromosomes, estimation of population divergence will 
depend on the loci investigated, their position relative to 
sites under selective pressure, and recombination rate 
(see, e.g., Bierne 2010; Payseur 2010). This leads to differ- 
ent introgression patterns among (classes of loci) across 
tension zones (e.g., Barton and Hewitt 1985). Conversely 
- but considerably less studied - new fit hybrid genotypes 
may arise from successfully recombinant individuals 
(Harrison 1993; Edmands et al. 2005; Hwang et al. 2011; 
Willett 2012). A new population (or taxon), designed as a 
hybrid swarm, can then emerge and successfully occupy a 
portion of the available habitat (Moore 1977; Arnold 
1993; Jiggins and Mallet 2000). 

The European sea bass Dicentrarchus labrax L. is a 
model for evolutionary studies among marine fishes, in 
both wild and cultured populations (Volckaert et al. 
2008; Chatain and Chavanne 2009). Over its distribution 
area, the species is structured into three main metapopu- 
lations (Atlantic, Western, and Eastern Mediterranean 
[EM]), separated by well-defined barriers to gene flow 
(Benharrat et al. 1983; Naciri et al. 1999; Bahri-Sfar et al. 
2000; Lemaire et al. 2005). Genetic homogeneity predom- 
inates in Atlantic and Western Mediterranean (WA) 
basins (e.g., Garcia de Leon et al. 1997; Naciri et al. 1999; 
Fritsch et al. 2007; Coscia and Mariani 2011), while the 
less well-studied EM basin is more structured (Bahri-Sfar 
et al. 2000; Castilho and Ciftci 2005). The genetic transi- 
tions over the range are each coincident with an oceanic 
front (the Almeria-Oran front [AOF] and the Siculo- 
Tunisian Strait [STS]). This may reflect tension zones that 
have been trapped there, with low dispersal rates and 
somewhat reduced population densities (Barton's trap- 
ping hypothesis; discussed in Bierne et al. 2011). Evidence 
that these transitional areas are tension zones in sea bass 



has however only been documented for the AOF by Lem- 
aire et al. (2005). Authors demonstrated differential intro- 
gression at mtDNA and nuclear markers, but also among 
anonymous nuclear DNA markers (microsatellites) at the 
AOF. Lemaire et al. (2005) also reported significant link- 
age disequilibrium (LD) at population samples located 
close to the AOF, which are classical features of tension 
zones (Barton and Hewitt 1985). 

All studies on sea bass have been based on a limited 
number of anonymous microsatellites of unknown 
genomic location. Therefore, the objectives of this study 
were to use distinct classes of nuclear markers that were 
either associated or independent of genes, in order to: (1) 
provide better estimates on the levels of differentiation at 
major transitional areas; (2) establish connectivity and 
introgression patterns for each group of markers or pat- 
terns shared by markers irrespective the group they 
belong to; and (3) improve knowledge on the origin of 
the central WM population of sea bass, in particular 
whether it is a hybrid swarm. 

Materials and Methods 

Samples and molecular analysis 

Sixteen natural populations (N = 474) were sampled 
among the three phylogeographic units of D. labrax dis- 
tribution area: Atlantic Ocean (AO), WM, and EM basins 
(Table 1). Most samples have been already analyzed in 
previous studies (Table 1). Nineteen microsatellite and 
one minisatellite markers were used, classified into two 
classes of markers: gene-associated loci (hereafter GAL) 
and gene-independent loci (GIL). A marker was classified 
as a GAL when it was situated either inside the span of 
an annotated gene or at <2 kilobases (kb) from one 
extremity of a coding sequence (CDS). The limit of 2 kb 
was retained because it generally encompasses the 
proximal promoter of most genes in vertebrates. Proximal 
promoters are known to be under functional constraints 
and, together with their closeness from CDS, mini- and 
microsatellite loci they contain may have greater probabil- 
ity to be influenced by selection (e.g., Li et al. 2004; 
Gemayel et al. 2010). A symmetrical size limit of 2 kb 
was also retained after the 3'-end of the CDS while - as 
far as we know - documented functional constraints are 
less reported on this side of genes. Those limits are, how- 
ever, operating limits to classify GAL and GIL; they do 
not mean that loci located farther from CDS cannot be 
impacted by selection (and vice versa). The GAL com- 
prises one mini- and three microsatellite markers that 
have been specifically designed for their association with 
growth hormone (GH: fiGH and tnGH), somatolactin 
(SL: jiSL), and insulin-like growth factor (IGF-1: jjJGF-1; 
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Table 1. Samples used in this study. 



Hybrid Swarm in Sea Bass 



Basin 


Abbreviation 


Population 


Country 


Number of individuals 


Geographic location 


Origin of sample 


AO (31) 


BB 


Bay of Biscay 


France 


31 


45°44'19"N 


1°25'1 1 "0 


Fritsch et al. (2007) 


WM (365) 


GOU 


La Goulette 


Tunisia 


30 


36°50'06"N 


10°19'59"E 


Guinand et al. (2008) 




ICH 


Ichkeul 


Tunisia 


49 


37°10'21 "N 


9°39'55"E 


Bahri-Sfar et al. (2000) 




MAR 


Marsala 


Italia 


24 


51 48 09 N 


1 1 lb I I b 


Naciri et al. (1999) 




ANN 


Annaba 


Algeria 


24 


36 54 47 N 


7°46'36"E 


Naciri et al. (1999) 




SBD 


Sabaudia 


Italia 


16 


41 1 9 36 N 


1 3 59 30 E 


Lemaire et al. (2000) 




FIUM 


Fiumicino 


Italia 


22 


41°51'36"N 


12°70'02"E 


Lemaire et al. (2000) 




OR 


Etang de I'Or 


France 


54 


43°34'35"N 


4°01'40"E 


This study 




PER 


Perols 


France 


30 


43°33'50"N 


3°58'10"E 


This study 




LUN 


Lunel 


France 


30 


43°36'05"N 


4°05'12"E 


This study 




SET 


Sete 


France 


38 


43°23'06"N 


3°42'19"E 


Guinand et al. (2008) 




MUR 


Murcia 


Spain 


48 


37°51'50"N 


0°43'31"O 


Lemaire et al. (2005) 


EM (98) 


SYR 


Syria 


Syria 


32 


35°34'50"N 


35°31'36"E 


This study 




CYP 


Cyprus 


Cyprus 


15 


34°33'03"N 


32°58'36"E 


Boutet et al. (2008) 




SYRA 


Syracuse 


Italia 


26 


37°47'07"N 


1 8°48'27"E 


This study 




SEL 


Selinunte 


Italia 


25 


36°50'53"N 


13°34'07"E 


This study 



AO, WM, and EM indicate the basin from where samples originate (AO: Atlantic Ocean; WM: Western Mediterranean; EM: Eastern Mediterra- 
nean). Numbers of individuals considered in each basin are given in brackets. References of published work that already used individual samples 
are reported. 



Quere et al. 2010; description and conditions herein) and 
seven microsatellite loci from two simple sequence repeat 
(SSR)-enriched genomic libraries (Tsigenopoulos et al. 
2003; Chistiakov et al. 2004) that appeared to be close to 
CDS afterward. The GIL class contains nine loci originat- 
ing from the same SSR-enriched genomic libraries than 
GAL (Table 2). 

Markers from SSR libraries were amplified in two mul- 
tiplexes PCR (Table 2). For the first multiplex, amplifica- 
tion conditions have been reported in Guinand et al. 
(2008). The second multiplex was optimized for a final 
volume of 10 fiL with 10 x Taq buffer, 30 mmol/L MgCl 2 , 
2.7 mmol/L dNTP, 10 ng of genomic DNA and 4 /miol/L 
of each primer. Reactions were performed on a PTC-200 
(MJ Research, St Bruno, Quebec, Canada) as follows: an 
initial denaturation at 95°C for 3 min, 35 cycles at 94°C 
for 45 s, annealing at the 58°C for 45 s, and 72°C for 
45 s, followed by a final elongation at 72°C for 10 min. 
Genotyping was performed on ABI PRISM 3130x1 or 
3700 DNA Analyser (Life Technologies, St Aubin, 
France), using 5'-labeled reverse primers and the Gene- 
Scan® TM-500 LIZ Size Standard (Life Technologies) as 
internal size standard. 

Data analyses 

After control for null alleles with MicroChecker (Van Oo- 
sterhout et al. 2004), all analyses - except neutrality tests 
- were first performed with individual samples (Fig. 1; 
Table 1), then with pooled samples corresponding to the 
three metapopulations. In order to provide descriptive 



summary statistics, the mean number of alleles (A), the 
mean unbiased expected gene diversity (H exp ; Nei 1987) 
were computed for each locus and each (meta)popula- 
tion. Significant within or among basins differences in A 
and H exp were tested using the nonparametric Mann- 
Whitney (/-test implemented in the Microsoft Excel 
add-on, StatEL. Complementarily, genetic description of 
sample was assessed by computing LD. LD between pairs 
of loci was computed using GENETIX v4.05 (http:// 
genetix.univ-montp2.fr/) in each sample, and tested using 
permutation tests (5000 permutations). Finally, deviations 
from Hardy-Weinberg expectations (HWE) within each 
original sample and within each metapopulation were 
investigated using / (Weir and Cockerham 1984), also 
using GENETIX. The null hypothesis (/= 0) of no signifi- 
cant departure from panmixia was tested by randomly 
permuting alleles (n = 1000) from the original matrix of 
genotypes. 

Among-sample comparisons were assessed by estimat- 
ing levels of population differentiation using 8, an estimator 
of F sx (Weir and Cockerham 1984). Estimates of 8 were 
independently computed for three data sets (i.e., the full 
set of 20 loci, then GAL and GIL sets). Their significance 
was tested according to the permutation procedure avail- 
able in GENETIX and corrected for multiple tests accord- 
ing to Benjamini and Yukitieli (2001) false discovery rate 
(FDR; Narum 2006). Distance trees were inferred from 
Reynolds et al. (1983) coancestry genetic distance matrix 
[D = 1 — ln(l — 8)] by the Neighbor procedure of the 
phylogenetic package Phylip 3.6 (Felsenstein 2005). Trees 
were visualized using Treeview (Page 1996). Following 
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Table 2. Microsatellite loci used in this study, including size ranges, GenBank accession numbers, fluorochrome labeling, and PCR primers. 











GenBank 


Size range 








Locus 


Category 


Annotation 


Motif 


Accession 


(bp) 




Fluorochrome 




Primers (5'=>3') 


DLA0041 


GIL 


- 


(TC) 8 (AC) 21 AA(AC) 3 AA(AC) 3 


DQ363864 


167 


-201 


FAM 


F 
Ft 


AAAAGGAACAGCCCTCCAC 
AGCATTGTTCTTCTGAGTGACC 


DLA0044 


GAL 


Unknown 


(TC) 18 TTTT(TC) 6 CTCC 


DQ363867 


105- 


-129 


HEX 


F 

R 


TCCGCTCCGCACCGAGTGAC 
ACCGCCCAAGGGTTGGACTG 


DLA0051 


GAL 


MAPK3 a 


(GT) 16 


DQ363874 


149- 


-181 


ROX 


F 
R 


AGGTTCTTGGCCTGGGAATC 
AGTGACAGCAGCCTCCAGAG 


DLA0060 


GAL 


Bestrophin 3 b 


(CA) 12 (TA) 3 AA(CA) 2 


DQ363883 


119- 


-131 


FAM 


F 

R 


GAGAGTTCATCCTGTTC GCTC 
TGTAGTAATAATG CGCTCTGCAA 


DLA0061 


GIL 


- 


(TG) 14 


DQ363884 


153- 


-167 


FAM 


F 

R 


AAAGGCCAGTGAAACTCATGT 
CTCCCTGTCCATCTGTCCTC 


DLA0066 


GIL* 


- 


(AG) 22 


DQ363887 


135 


-161 


PET 


F 

R 


GTTGACCGGAGTCCTAGC 
GGCCATATGTGTCTTGCTT 


DLA0068 


GIL 


- 


(CA) 7 CGCACG(CA) 3 


DQ363889 


247- 


-266 


NED 


F 

R 


CAACACCTGTTCCTCTGAACC 
GCATTAGCATTGATTGTCCTG 


DLA0070 


GAL 


Unknown 


(AC) 30 


DQ363891 


126- 


155 


VIC 


F 

R 


TCTGCTTGCATCTGTGGAAT 
GCCATCTGGCTAGCTTCACT 


DLA0073 


GIL* 


- 


(CT) 36 


DQ363894 


157- 


-181 


NED 


F 

R 


CATGACTTCATGTGCTAATGTCC 
AGTTCAGAGCGGCAACTGT 


DLA0075 


GIL 


- 


(CA) 15 


DQ363896 


180- 


-186 


PET 


F 

R 


CACATACACAAGCTTAACCC 
GGCAGAGATGGGAAATAGACA 


DLA0078 


GAL 


MAN2A1 C 


(AG) 29 


DQ363899 


223- 


-243 


VIC 


F 

R 


AAGACTGGACCTCTGGAGACC 
CACAAGGAACCGAGACAAGA 


DLA0081 


GAL* 


PPP2R5A d 


(CA), 6 


DQ363902 


202- 


-222 


PET 


F 

R 


GACGAAGACTTCAGAC GAGCTAT 
ATACCGAGCGACCATGTTG 


DLA0086 


GIL 




(AC) 26 


DQ363907 


191- 


-205 


FAM 


F 

R 


GCTAGAGGATTCATGTCGCTT 
ACCTGGTGATTGGCAATTCT 


DLA0089 


GAL 


LLGL1 6 


(GT) 15 


DQ363910 


127- 


-135 


NED 


F 

R 


ACGAGTAATGAGGACCCA 
GTCAAAACAGCCCACCTA 


DLA0096 


GIL* 




(TG) 16 


EF471091 


256- 


-270 


ROX 


F 

R 


AACTTAGTGAAGTAACTTGTGGCAA 
TCGATGCATCTAGGACAGGA 


DLA0097 


GIL 




(GT) 4 GC(GT) 2 GC(GT) 13 


EF471092 


216- 


-240 


HEX 


F 

R 


GCTGCAGGAGTGTGAGAGG 
GCGAGAGACTCGAGGAAGA 



Categories of loci to which markers belong to are reported, together with their functional annotation for GAL when available. Note that GAL 
developed by Quere et al. (2010; \iGH, mGH, \iSL, and jilGF-1) associated with growth hormone (GH), somatolactin (SL), and insulin-like growth 
factor 1 (IGF-1) are not reported in this table. Loci marked with an asterisk indicated loci that did not amplified in the EM samples, and were ana- 
lyzed only in the AO and WM samples. Alphabetical superscripts refer to selected references in which functional role of annotated GAL markers 
are reported. 

1 Size range established before this study. 
*Locus that was not genotyped in the EM basin. 
a Garcia et al. (1998); Kultz (1998). 
b O'Driscoll et al. (2008); Matchkov et al. (2008). 
c lshida et al. (1999). 
d Janssens and Goris (2001). 
e Wodarz (2000), Vasioukhin (2006). 

O'Reilly et al. (2004), we explored the relationship 
between 9 and H exp to check for effects of size homoplasy. 
We specifically controlled whether GIL and GAL markers 
were differently affected. We further explored the possible 
role of selection in shaping differentiation in the full, 
GAL, and GIL data sets using neutrality tests developed 
by Beaumont and Nichols (1996; hereafter B&N and 
implemented in LOSITAN Antao et al. 2008, http://pop- 
gen.eu/soft/lositan/), by Vitalis et al. (2001; implemented 



in DETSEL Vitalis et al. 2003, http://www.genetix. 
univ-montp2.fr/detsel.html), and by Foil and Gaggiotti 
(2008; hereafter: F&G and implemented in BayeScan, 
http://www-leca.ujf-grenoble.fr/logiciels.htm). All tests 
were first performed among populations within each 
basin, then among the three basins except for DETSEL, 
specifically designed for pairwise comparisons then 
between each pair of basins. Outlier loci detected using 
several methods would be stronger candidate to the 
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Figure 1. Map of sampling locations; samples are abbreviated as in 
Table 1. The colored symbols for samples belonging to each 
metapopulation will be used in other figures. Location of the Almeria- 
Oran Front and the Siculo-Tunisian Strait are reported. 

action of selection than loci detected once. The B&N and, 
when considered, the DETSEL tests were both performed 
according to P = 0.95 and P = 0.99 successively. F&G test 
considered a Log 10 (BF) of 1.3 and 2 (BF = Bayes Factor) 
corresponding to posterior probabilities of locus effects of 
0.95 and 0.99, respectively. 

The program MIGRATE-n v3.0 (Beerli 2008) was used 
to infer M, the migration rate, among basins only 
(M = ml p., where m is the immigration rate per genera- 
tion). Contrary to most F st -based statistics, MIGRATE-n 
provides estimation of asymmetric migrations among 
(meta)populations, indicating if a (meta) population is a 
net donor or net receiver of individuals over evolutionary 
times. It hence provides a picture how tension zones may 
function. These calculations used the Brownian mutation 
model and the mutation rate was considered equal for all 
loci (n = 10~ 4 ) even for GAL. We used coalescent maxi- 
mum likelihood (ML) based on Markov Chain Monte 
Carlo with Hastings Metropolis importance sampling to 
infer the various parameters (Beerli and Felsenstein 1999, 
2001). F ST estimates among basins were used as initial 
parameters for the estimation of 0 and M in MIGRATE- 
n. For each locus, the ML was run for 10 short and five 
long chains with 50,000- and 10,000-recorded genealogies, 
respectively, after discarding the first 1000 genealogies 
(burn-in) for each chain. One of every 20 reconstructed 
genealogies was sampled for both the short and long 
chains. We used an adaptive heating scheme with four 
concurrent chains. Analyses were all performed in tripli- 
cates either on the full set of loci or on the GAL and GIL 
sets independently. 

As a complement to estimates of population differenti- 
ation and asymmetric migration rates, we computed the 
probability of membership of individuals to each 
metapopulation (i.e., individuals should have higher 
probability of membership in the metapopulation they 
were sampled). Assignments of individuals to populations 



and associated probabilities were inferred with the soft- 
ware STRUCTURE v2.3 (Pritchard et al. 2000; http:// 
pritch.bsd.uchicago.edu/structure.html) by setting K = 3 
(i.e., three basins) after verification of the most likely 
number of independent population clusters was K = 3 
using Evanno et al.'s (2005) AK method (details not 
reported). The software used a Monte Carlo Markov 
Chain (MCMC) Bayesian clustering method that maxi- 
mizes the within-cluster Hardy-Weinberg and linkage 
equilibriums. The admixture model with noncorrelated 
allele frequencies was used for the full, GAL, and GIL 
data sets. A burn-in length of 50,000 iterations and subse- 
quent 500,000 additional MCMC iterations were carried 
out. Individuals were assigned to clusters based on the 
highest probability of membership (q-value). Five repli- 
cates were independently performed, giving reproducible 
results. Average q-values (±1 standard deviation) of indi- 
viduals to each metapopulation were hence computed 
from those individual assignments. 

Analyses using MIGRATE-n and STRUCTURE were 
first performed using initial sample sizes of the three 
basins (Table 1), but, as sample sizes were unbalanced, 
we also performed analyses with 31 individuals in each 
basin (i.e., the number of individuals from the AO basin). 
Results were highly reproducible and qualitatively compa- 
rable to those obtained with the full set of individuals 
and will not be reported. 

Results 

Numerous amplifications failures were observed for fishes 
from EM populations, at several loci (GAL: DLA0081, 
mGH; GIL: DLA0066, DLA0073, DLA0096). This was 
probably due to loss of sequence homology at PCR prim- 
ing sites. Those loci were discarded from some analyses, 
as relevant. Therefore, genetic comparisons were per- 
formed with 20 loci (11 GAL, nine GIL), except those 
involving EM populations, which were only based on 15 
loci (nine GAL, six GIL). 

Gene and allelic diversities 

Allelic and gene diversities are reported in Table 3. There 
were no differences among populations within the WM 
or the EM basin for the whole set of loci, or for the GAL 
and GIL marker sets considered separately (details not 
reported), (20 or 15 loci, respectively). A and H exp were 
not significantly different between the AO and WM meta- 
populations (P > 0.05 in each case; 20 loci). In contrast, 
A and H ex p of the EM metapopulation were significantly 
lower than the other metapopulations for the 15 loci 
(P < 0.001). When considering the GAL and GIL marker 
sets independently, A was significantly larger for GAL 
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Table 3. Gene and allelic diversity indices for each individual samples and metapopulations (AO, EM, WM) of sea bass considered in this study. 



Basin 


Sample 


Multilocus 






GAL 






GIL 








A, 


f 




A, 


f 


^exp 


A r 


f 


AO 


GG 


0.77 


8.70 


0.01 


0.78 


9.33 


-0.01 


0.76 


7.75 


0.03 


WM 


MUR 


0.79 


9.65 


0.01 


0.79 


10.17 


0.01 


0.78 


8.88 


0.01 




OR 


0.77 


9.20 


0.00 


0.76 


9.48 


0.04** 


0.79 


8.73 


0.06*** 




SET 


0.77 


7.84 


0.01 


0.77 


8.64 


-0.01 


0.78 


6.75 


0.03 




PER 


0.77 


8.61 


-0.01 


0.76 


9.17 


-0.01 


0.77 


7.75 


0.00 




LUN 


0.78 


8.90 


0.02 


0.78 


8.58 


0.00 


0.79 


9.38 


0.05 




FIUM 


0.75 


7.65 


0.00 


0.74 


8.25 


-0.04 


0.76 


6.70 


-0.02 




SBD 


0.77 


7.36 


-0.03 


0.74 


7.25 


-0.04 


0.81 


7.53 


-0.02 




MAR 


0.76 


8.13 


-0.01 


0.76 


8.06 


-0.02 


0.76 


8.26 


0.01 




ANN 


0.78 


9.47 


0.00 


0.79 


9.54 


-0.02 


0.77 


9.38 


0.03 




ICH 


0.74 


8.12 


0.02 


0.73 


7.83 


-0.01 


0.76 


8.48 


-0.07*** 




GOU 


0.77 


8.54 


-0.03** 


0.76 


8.83 


-0.03** 


0.77 


8.13 


-0.01 




WM metapop. 


0.77 


8.49 


0.01 


0.76 


8.71 


-0.01 


0.78 


8.18 


0.01 


EM 


SEL 


0.68 


5.24 


-0.07 


0.69 


6.10 


-0.08* 


0.66 


4.19 


-0.05 




SYRA 


0.60 


3.93 


—0 22*** 


0.61 


4.20 


—0.24*** 


0.56 


3.25 


-0.18 




CYP 


0.61 


4.13 


-0.06 


0.67 


4.48 


-0.06 


0.57 


3.62 


-0.07 




SYR 


0.66 


5.21 


-0.08 


0.67 


5.82 


-0.06 


0.62 


4.03 


-0.06 




EM metapop. 


0.64 


4.62 


-0.07*** 


0.67 


5.15 


-0.07*** 


0.61 


3.76 


-0.06 



AO, Atlantic Ocean; EM, Eastern Mediterranean; WM, Western Mediterranean; GAL, gene-associated loci; GIL, gene-independent loci. 
Multilocus estimates of fare also provided. *P < 0.05; "P < 0.01; ""P < 0.001. 



than for GIL in the WM metapopulation (Mann-Whitney 
U- test, P = 0.006). No such difference was observed 
between GAL and GIL in the AO and EM basins for H exp 
(P = 0.055 and P = 0.12, respectively). 

Hardy-Weinberg equilibrium and linkage 
disequilibrium 

After corrections for multiple tests, no significant depar- 
ture from HWE was detected for any individual samples 
belonging to the AO and WM basins, whatever data were 
considered (not reported). HWE was also respected after 
pooling the WM samples into a single metapopulation 
(all markers: / = 0.01, NS; GAL: / = -0.01, NS; GIL: 
/ = 0.01, NS; Table 3). For the EM populations, there 
was a significant heterozygote excess over the 15 loci in 
the SYRA (Syracuse) population (/ = -0.22; P < 0.001), 
while HWE was respected in other individual samples. 
This induced a significant heterozygote excess in the EM 
metapopulation (/ = —0.07; P < 0.001). It was mostly 
due to GAL markers in the SYRA and SEL (Selinunte) 
populations if = -0.24, P < 0.001; / = -0.08, P < 0.05, 
respectively). No significant departure from HWE was 
detected in the EM metapopulation for the multilocus 
GIL marker set (f = -0.06; NS; Table 3). 

Because larger sample sizes prevent detection of spuri- 
ous genotypic associations among loci, analysis of LD was 
primarily performed at the metapopulation level for Med- 
iterranean samples (i.e., AO consists of a single sample). 



After correction for multiple tests, there were only three 
significant cases of LD in the WM metapopulation. In the 
EM metapopulation, 12 significant cases of LD were 
found, that involved only four loci: two GIL and two 
GAL markers (DLA0068, DLA0086, DLA0070 and 
DLA0089). 

Relationships between 0 and gene diversity 

Relationship between 6 and H exp within WM and EM 
revealed no significant negative correlations (WM| 20 i oc i}'- 
R 2 = 0.062, NS; EM [15 i oci] : R 2 = 0.129, NS; Fig. SI). 
There were no significant correlations by pairwise meta- 
population comparison, AO-WM, AO-EM and WM-EM, 
nor comparison among the three metapopulations (AO- 
WM |2C i oci] : R 2 = 0.158, NS; AO-EM |15 loci| : R 2 = 0.052, 
NS; WM-EM, 15 lod] : R 2 = 0.043, NS; three metapopula- 
tionS[i5 i oc j]i R 2 = 0.028, NS). When present, homoplasy 
did not significantly affect data and these results legiti- 
mize further interpretation in terms of differential gene 
flow and/or selection. 

Genetic differentiation within the WM and 
EM metapopulations 

Within the WM basin, no significant differentiation was 
detected (6 = 0.004, NS). There was no differentiation for 
the GIL or the GAL data separately (0 = 0.001, NS; 
6 = 0.004, NS, respectively). Within the EM basin, there 
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was significant differentiation only for the GAL data 
(9 = 0.049, P < 0.01), but not for the full or GIL data 
(9 = 0.021, NS; 9 = -0.006, NS, respectively). 

Genetic differentiation among basins 

Reynolds' distance trees based on the 15 locus data clearly 
partitioned individual samples into three basins (Fig. 2). 
Levels of differentiation were generally significant and 
dependent on the pairwise basin comparison and markers 
used (GAL or GIL; Fig. 2). The differentiation between 
AO and WM was greater than between WM and EM, 
whatever data considered (Fig. 2). 

The individual values of 9 for each locus in the AO- 
WM and the WM-EM comparisons are reported in 
Figure 3A,B, respectively. Between AO and WM (20 loci 
used), five loci had 9 > 10%, including locus mGH where 
9 f=s 25%. The multilocus differentiation estimator 9 was 
0.056 (P < 0.001) with significant differentiation at nine 
markers (GAL: six markers; GIL: three markers, Fig. 3A). 
The GAL data had higher values of 9 than the GIL data 
(9 = 0.073 vs. 9 = 0.034; both P < 0.001; Fig. 2). Using 
15 loci, levels of genetic differentiation were significant 
between WM and EM (9 = 0.026; P < 0.001), with signif- 
icant individual estimates of 9 at nine of the 15 loci 
(GAL: four markers; GIL: five markers; Fig. 3B). Contrary 
to the AO-WM comparison, differentiation at the GIL 
markers was greater than at the GAL markers in the WM 



-EM comparison (respectively, 
both P < 0.001; Fig. 2). 



0.036 and 9 = 0.020, 



Patterns of genetic differentiation 

There was wide variability in spatial patterns of allelic 
frequency changes (Fig. 4), including cases where allele 
frequencies were (1) almost identical in all metapopula- 
tions, reflecting either high gene flow, shared ancestral 
polymorphisms, or general selective sweeps at these loci 
(e.g., DLA0089, /(GH, resulting in nonsignificant 9 values 
in Fig. 3B), (2) identical in two adjacent metapopulations 
(AO-WM or WM-EM) then marking a sharp break at 
one of the transition zones (e.g., DLA0051, DLA0097), or 
(3) in an apparent east-west gradient of allele frequencies 
(e.g., DLA0068). Those latter two classes of loci naturally 
coincided with loci achieving higher levels of differentia- 
tion (Fig. 3). Locus DLA0060 appeared as a particular 
case with almost identical allele frequencies for the meta- 
populations located at each side of the distribution but 
drastic change in frequency for the most frequent alleles 
in the central metapopulation. 

Population assignment and migration 

Analyses regarding assignment were initially performed 
for the 15 markers in all metapopulations, then only 
using GAL (n = 9; DLA0044, DLA0051, DLA0060, 



All markers 




Figure 2. Reynold's distance trees for each marker set used in this study (all markers [15 loci]: GAL [9 loci] and GIL [6 loci]). Figures located at 
the bottom of each tree summarized patterns of genetic differentiation among samples within metapopulation (no estimate for Atlantic Ocean 
[AO] as one unique sample was considered in this metapopulation; Table 1), and among metapopulations as estimated by 9 for each marker set. 
In the case of the AO-Western Mediterranean, 8 values are also reported in brackets for the 20 loci marker set, then for the set of 1 1 and 9 GAL 
and GIL marker sets, respectively. Samples are abbreviated as in Table 1 and colored symbols as in Fig. 1 . **P < 0.01, ***P < 0.001 . 
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(b) 



3. 



II I 



a 



S 



Figure 3. Estimates of 8 observed at each individual locus for (a) the Atlantic Ocean-Western Mediterranean (WM) comparison (20 loci), and (b) 
the WM-Eastern Mediterranean (EM) comparison (15 loci). Loci are arranged according to increased values Of 9 in each comparison. Horizontal 
lines indicate mean values of 9 observed in each comparison (shaded area = 95% CI). Vertical lines indicate which individual loci were found 
significant (on the right) compared to those that were not (on the left) for each comparison. Open and black boxes indicate GIL and GAL, 
respectively. Asterisks indicate loci absent in the WM-EM comparison. 



DLA0070, DLA0078, DLA0089, piGH, filGFl, ^iSL) or GIL 
(n = 6; DLA0041, DLA0061, DLA0068, DLA0075, 
DLA0086, DLA0097; Fig. 5). Unfortunately, some loci 
with a dramatically high 9 between AO and WM could 
not be considered because they failed to amplify in EM 
(e.g., mGH; Fig. 3A). This lowered the probability of 
assignment for AO and WM individuals. Nevertheless, 
better mean average assignment to their population of 
origin was achieved for AO and EM individuals (mean q- 
value = 0.548 ± 0.156 and 0.868 ± 0.151, respectively) 
than for the WM individuals (q = 0.326 ± 0.206; WM 
individuals were in average better assigned to the EM 
cluster: q = 0.487 ± 0.291 when using all loci). This sug- 
gested higher introgression in the WM compared with 
the other basins (Fig. 5), and correlated well with lower 
all-marker genetic differentiation estimated at the STS 
compared to the AOF (Fig. 2). When analyzing the GAL, 
only EM individuals were better assigned on average to 
their actual population of origin (q = 0.596 ± 0.223), 
while correct assignment was far lower for other meta- 
populations (AO: q = 0.417 ± 0.117; WM: q = 0.370 
± 0.134). When the GIL was used and whatever the 



metapopulation considered, probabilities of correct 
assignment did not differ significantly from q = 0.33 
(details not reported). It has to be emphasized that results 
for GIL in this analysis did not indicate lower intrinsic 
relevance of GIL to assign individual, nor increased gene 
flow for this class of loci, but were primarily due to the 
low number of GIL in this particular analysis (n = 6) 
with only one locus demonstrating genetic differentiation 
among each pair of basin (DLA0068; Fig. 3). At the scale 
of the three metapopulations, it resulted that those six 
loci collectively had significant but low average 9 = 0.031 
(P < 0.001), resulting in poor ability to successful indi- 
vidual assignment. Using simulated data sets in a five 
deme system, Latch et al. (2006) reported an average 
value of 9 = 0.030 as a lower bound to consistently assign 
individuals to a subpopulation (i.e., with a probability 
distinct from equiprobability). Empirical 9 observed in 
sea bass is very close from this lower bound, easily 
explaining poor assignment capacities with this data set. 

Individual WM samples located close to the STS (e.g., 
GOU, MAR; Fig. 1) present a compound allele pattern 
slightly closer to that of individuals from the EM samples 
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Figure 4. Variation of allele frequencies observed at each individual locus among sea bass metapopulations. For each marker, colored lines 
represent variation for a single allele. Because of size of the panels associated to loci, all alleles might not be visible at each locus. 
Metapopulations are abbreviated as in Table 1 . 
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All markers 

(15 loci) 



GAL 



GIL 




AO WM EM 



Figure 5. Summary of results obtained by MIGRATE-n regarding the estimation of number of migrants among sea bass metapopulations for 
each set of markers considered in this study (15 loci data set; left panel), and STRUCTURE regarding probability of membership of each individual 
to one of the K = 3 sea bass metapopulations considered in this study. Thickness of arrows is broadly proportional to estimated number of 
migrants. Samples and metapopulations are abbreviated as in Table 1. 



than to WM samples located further west (e.g., MUR) for 
all assignments, either performed over the full data set or 
with the random subsets of 31 individuals. Despite this, 
WM individuals from western and eastern WM samples 
were not assigned significantly differently from EM sam- 
ples over five replicates, whatever the number and class of 
markers considered (details not reported). 

Maximum-likelihood estimates of migrations using 
MIGRATE-n revealed asymmetric exchange from AO to 
WM with a approximately threefold to fivefold difference 
in number of successful migrants when using GAL and all 
markers (GAL: M ao ^wm = 43.03 vs. M WM ^ AO = 12.32; 
all markers: M ao ^wm = 34.47 vs. M WM ^ AO = 6.22; 
Fig. 5). This asymmetry of migration between AO and 
WM was inferior to two for the GIL data set (Fig. 5). 
Between WM and EM, the asymmetry was also estimated 
with a higher number of alleles from EM migrating toward 
WM. The extent of the asymmetrical exchange was how- 
ever lower when considering all markers (M EM _> 
WM = 15.64 vs. M WM ^ EM = 9.62), or the GIL and GAL 
data sets separately (Fig. 5). Results of MIGRATE-n there- 
fore indicate greater introgression in WM than in other 
metapopulations. That is, WM received more alleles by 
migration than they provided to the other two metapopu- 
lations, suggesting it may represent a hybrid swarm 
separated by two tension zones at the AOF and STS where 



migration of genes are asymmetrically filtered. It was how- 
ever expected that lower estimates of genetic differentiation 
should translate in higher estimated number of migrants at 
the STS. This point has to be investigated further. 

Neutrality tests 

Neutrality tests were initially performed on individual 
samples within the WM and EM metapopulations, 
respectively. DETSEL was not used at this scale because 
of too many, not fully independent pairwise comparisons. 
In WM, no congruence between the B&N or the F&G 
tests was detected among potentially selected loci when 
using P = 0.95 (or Log 10 (BF) = 1.3). No locus was signif- 
icant for a more stringent probability or BF (not 
reported). Indeed, B&N indicated that three loci were 
outliers and hence could potentially be under selection 
(fiSL, DLA0066 and DLA0070) in WM, while F&G 
detected eight different loci (DLA0044, DLA0068, 
DLA0081, DLA0089, DLA0096, fiIGF-1, fiGH, and mGH). 
In EM, F&G did not detected potentially selected loci 
whatever the Log 10 (BF) considered, while B&N detected 
three loci at P = 0.95 (fiSL, fiGH, and fiIGF-1; none at 
P = 0.99). It must be noted that those loci demonstrated 
the highest levels of differentiation among EM samples 
(details not shown). As results of neutrality tests failed to 
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converge, the reality of nonneutrality within metapopula- 
tions is open to question. 

Results were slightly more conclusive when performing 
neutrality tests among metapopulations. In the AO-WM 
comparison, the three tests did not provide similar 
results, but all indicated locus DLA0068 (GIL) as a possi- 
ble outlier (Table 4). For the WM-EM comparison, B&N 
and DetSel detected the DLA0060 (GAL), DLA0068, and 
DLA0097 (GIL) loci as outliers for a minimal probability 
level of P = 0.95. F&G also detected three outlier loci for 
Log 10 (BF) > 1.3: DLA0068, ^tlGF-l and fiSL (Table 4). 
Locus DLA0068 was detected by all three tests and was 
the locus with the sharpest pattern of graded variation 
across metapopulations (Fig. 4). 

Discussion 

Genetic differentiation between Atlantic and Mediterra- 
nean populations of marine organisms is usually inter- 
preted as the signature of secondary contact between two 
formerly allopatric lineages, occurring since the last glacial 
maximum (Borsa et al. 1997; Patarnello et al. 2007; but 
see, e.g., Grant 2005; Limborg et al. 2012). Accordingly, 
support for two distinct mtDNA clades, or nuclear differ- 
entiation, between Atlantic and Mediterranean popula- 
tions at the AOF is numerous in marine organisms 
(Patarnello et al. 2007). When differentiation at the 

Table 4. Summary of loci found as being potentially selected at the 



metapopulation level in each outlier 


detection 


selective test 


used in 


this study (B&N: Beaumont and Nichols 1996; 


DetSel: Vitalis et al. 


2001; F&G: Foil and Gaggiotti 2008). 










Neutrality Test 




Locus 


B&N 


DetSel 


F&G 


A: AO vs. WM DLA0051 


0.993 


>0.99 




DLA0060 




>0.95 




DLA0061 




>0.95 


0.992 


DLA0068 


0.973 


>0.99 


1.0 


DLA0070 


0.995 


>0.95 




DLA0075 


0.999 




0.976 


DLA0096 


0.958 


>0.99 




DLA0097 


0.999 




0.952 


mGH 


0.978 


>0.99 




f ilGF1 


0.997 




1.0 




0.967 




0.983 


B: WM vs. EM DLA0060 


0.993 


>0.95 




DLA0068 


0.964 


>0.99 


0.982 


DLA0097 


0.996 


>0.99 




f ilGF1 






1.0 


t iSL 






0.997 



AO, Atlantic Ocean; EM, Eastern Mediterranean; WM, Western Medi- 
terranean. 

Locus DLA0068 found as an outlier in each test is indicated in bold. 



AOF is not demonstrated (e.g., Bargelloni et al. 2003), 
differentiation is generally detected in another area of the 
Mediterranean, including the STS and some other loca- 
tions (e.g., south of Peloponnesus; Patarnello et al. 2007). 
The originality of sea bass resides in their partition into 
three metapopulations, separated by two areas of reduced 
genetic connectivity: the AOF and the STS (Naciri et al. 
1999; Bahri-Sfar et al. 2000; Lemaire et al. 2005). To our 
knowledge, reports of two locations of reduced connectiv- 
ity within the Mediterranean are pretty scarce, apart from 
the Black Sea (but see Rolland et al. 2007; Zulliger et al. 
2009; Limborg et al. 2012). In sea bass, the hypothesis of 
a purely physical barrier shaping variation at the AO- 
WM transition has been seriously reassessed by Lemaire 
et al. (2005), who demonstrated a tension zone at this 
location. By using nuclear markers (GAL and GIL), we 
expanded on the study by Lemaire et al. (2005) to dissect 
the sea bass' patterns of genetic variation further. 

Patterns of genetic differentiation and 
support for a tension zone at the Siculo- 
Tunisian strait 

This study did not change the perception of sea bass 
genetic structure within each basin at nuclear loci (i.e., no 
differentiation inside the WM basin but some differentia- 
tion within the EM basin; Garcia de Leon et al. 1997; 
Naciri et al. 1999; Bahri-Sfar et al. 2000; Castilho and 
Ciftci 2005), nor the existence of three metapopulations. 
Nevertheless, estimates of differentiation among successive 
pairs of basins were larger than previously estimated with 
fewer microsatellite loci. The differentiation between the 
AO and WM basin reported in this study (0[ 2u 
loci] = 5.6%) was more than twice that provided by Naciri 
et al. (1999) over six loci (9 « 2.1%). Estimates of differ- 
entiation at the WM-EM transition zone were also higher 
($[i5 loci] ~ 2.6%) than those reported by Bahri-Sfar et al. 
(2000) (6 « 1.4%; six loci). Results hence confirmed a 
stronger nuclear genetic barrier at the AOF than at the 
STS. This pattern was also demonstrated for mtDNA in 
sea bass (Lemaire et al. 2005; R. Rondon, E. Desmarais, 
F. Bonhomme and B. Guinand, unpubl. results), but also 
in sprat (Limborg et al. 2012). Using far less nuclear 
markers, Rolland et al. (2007) suggested on the contrary 
that differentiation should be stronger at the STS com- 
pared with the AOF for the common sole (Solea soled). 

GAL and GIL markers may both be involved in overall 
differentiation and/or differentiation among adjacent 
metapopulations, but the extent of differentiation 
depended on the particular comparison. Indeed, GAL that 
are located closer from or even within genes consistently 
demonstrated in average reduced genetic connectivity 
compared GIL at the AO-WM transition (20 loci; 
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6gal = 7.3%; 8 G i L = 3.4%). However, a reverse relation- 
ship was found at the WM-EM transition (15 loci; 
$gal = 2.0%; 6 G1L = 3.6%). This second observation 
hence partly conflicts with those reported in other studies 
in which gene-associated markers had, on average, better 
discriminatory power than independent markers (e.g., 
Vasemagi et al. 2005; Yatabe et al. 2007; Shikano et al. 
2010; Vilas et al. 2010). 

Besides differentiation, it is also interesting to note that 
significant LD and heterozygote excess was concentrated 
in a few samples. Heterozygote excess was especially 
marked for GAL at the WM-EM transition on both sides 
of Sicily: for the SEL and GOU samples, and at the Mes- 
sina canal for the SYRA sample. LD only occurred in the 
SEL and SYRA samples from the EM basin. This observa- 
tion is unlikely to be due to low sizes of SYRA and SEL 
samples, as others with similar sizes did not demonstrate 
any heterozygote excess and very few cases of LD. The 
GOU, SEL, and SYRA samples are all located at the WM- 
EM transition, suggesting that heterozygote excess and/or 
LD may be the signature of hybridization among parental 
pools at the STS, as already demonstrated by Lemaire 
et al. (2005) at the AOF. There, significant LD was only 
found in the sample closest to the AOF. In this study, we 
report a nearly identical situation at the STS (but note 
that the MAR sample also located close to the front did 
not show significant LD and/or heterozygote excess, in 
line with previous results by Naciri et al. (1999) and 
Bahri-Sfar et al. (2000) who also used this sample). 
Results are in agreement with the definition of a tension 
zone rather than with strict isolation by distance or 
parapatric differentiation. By theoretical modeling, Irwin 
(2012) has shown that genetically differentiated haploid 
clades arising from local adaptation along an environ- 
mental gradient can arise in a parapatric fashion under 
certain conditions and persist in a metapopulation even 
in the presence of moderate gene flow. Nevertheless, 
extension to multilocus diploid cases in high gene flow 
species has not been done, so it seems reasonable to stick 
to the less costly hypothesis of historical differentiation 
followed by secondary contact. If there was a single rea- 
son for this more parsimonious explanation, this would 
be at least because paleobiogeography indicates that favor- 
able ecological zones have indeed been fragmented several 
time during the glacial maximums, notwithstanding other 
theoretical considerations about all the conditions 
required for the progressive building up of multilocus dif- 
ferentiation in presence of gene flow in highly dispersing 
species. We hence can reasonably be confident that the 
encounter of differentiated genetic backgrounds in the sea 
bass is a likely possibility as demonstrated for other 
marine organisms with large dispersal capabilities (e.g., 
bivalve mollusks, Nikula et al. 2008). 



Spatial patterns support tension zones, but 
rarely selection at target loci 

In addition to LD and/or heterozygote excess, spatial 
patterns of allele frequency changes are in accordance 
with the prediction of a tension zone at both the AO- 
WM and WM-EM transitions. The theory predicts that 
allele frequency changes are more or less pronounced 
depending on the marker investigated, with steeper transi- 
tions for loci closer to "isolation genes" (Barton and 
Hewitt 1985). In this study, some markers present no 
change in allele frequency, reflecting either one uncon- 
strained high neutral gene flow, a selective sweep of one 
uniformly advantageous allele or the inheritance of an 
ancestral state where allele frequencies were already 
homogeneous. On the other hand, some loci show rela- 
tively sharp changes at AO-WM transition as already 
shown for mtDNA (Lemaire et al. 2005), or at the 
WM-EM transition, which has been less extensively inves- 
tigated (Bahri-Sfar et al. 2000). A vicariant event that 
produced at least two separate Atlantic and Mediterranean 
metapopulations of sea bass during one of the quaternary 
climatic oscillations will have led to the emergence of isola- 
tion genes. These endogenous barriers influenced many 
other loci by indirect (hitchhiking) selection at genomic 
scales, depending on their strength and the amount of 
recombination (Barton and Hewitt 1985; Pialek and 
Barton 1997; Slatkin and Wiehe 1998). According to gen- 
eral hybrid zone models, we postulate that barriers have 
been trapped at various locations which correspond to 
areas of reduced density and/or environmental change, 
such as the AOF and STS. A rationale of this phenomenon 
is provided in Bierne et al. (2011), including a review of 
marine examples (see also Luttikhuizen et al. 2012). 

Locus DLA0060 (GAL; located in intron one of the 
bestrophin three gene) presents a distinct pattern of spatial 
allele frequency compared to other loci, with the two most 
frequent alleles at higher frequencies in WM than in the 
marginal AO and EM metapopulations In the present case, 
we cannot distinguish whether locus DLA0060 was linked to 
an "endogenous" locus whose variation in allele frequency 
reflects genomic incompatibilities, or to an "exogenous" 
locus whose variation reflects local adaptation, both fitting 
with the coupling hypothesis of Bierne et al. (2011). 

In hitchhiking models of genetic variation, direct selec- 
tion on markers is not needed (Pialek and Barton 1997; 
Bierne et al. 2003). In sea bass, neutrality tests were 
unable to identify loci with clear footprint of selection, 
except DLA0068 (GIL). This locus presents a steeper gra- 
dient of allele frequencies than other loci, with a nearly 
fixed allele in the AO sample and an alternative one that 
predominates in the EM sample. This marker was not 
clearly associated with a gene. It is situated at approximately 
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4 kb from the 3' end of a Retinal G Protein Coupled 
Receptor gene (RGR) on one side and approximately 
6 kb from the 5' end of the CDS of a solute carrier gene 
(SLC18A3 also known as VAChT, e.g., Weihe et al. 1996; 
Karila et al. 1998) on the other. We cannot therefore 
reach firm conclusions about its functional importance. 

A hybrid swarm in the Western 
Mediterranean? 

The WM contains strongly introgressed individuals, but 
also receives migrants from other basins. Taken together, 
this indicates that the WM metapopulation may have a 
dual origin from both AO and EM nuclear genomes, 
which have introgressed into each other and led to the 
emergence of a new reproductive unit, known as a 
hybrid swarm (Arnold 1993; Jiggins and Mallet 2000). 
The absence of LD among loci in each WM sample, 
except the GOU sample located close to the STS, also 
supports this hypothesis of a sea bass hybrid swarm. 
Interestingly, Schunter et al. (2011) recently reported a 
similar result in the comber Serranus cabrilla within the 
WM basin. In their case, combers from locations located 
between the two recognized fronts for this species (AOF 
and the Ibiza Channel) were significantly more intro- 
gressed than individuals from populations on the exter- 
nal sides of each front, although the presence of tension 
zones was not investigated. Our results on sea bass are 
also comparable to experimental studies by Edmands 
et al. (2005) on the copepod Tigriopus californicus. After 
approximately 15 generations of mixing, Edmands et al. 
(2005) found more hybrids than expected, together with 
heterozygote excess. These experimental results conform 
to the situation observed in wild sea bass near tension 
zones, as heterozygote excess and LD were recorded at 
the STS and AOF (this study and Lemaire et al. 2005, 
respectively). Overall, the results suggest that the WM 
metapopulation of sea bass could result from hybrid 
swarming, that is separated from parental forms by two 
distinct tension zones. 

Hybrid swarms are poorly documented in marine 
organisms (McMillan et al. 1999; Roques et al. 2001; Rob- 
erts et al. 2010) by comparison with terrestrial or freshwa- 
ter taxa (e.g., Arnold 1997; Nolte et al. 2009; Stemshorn 
et al. 2011). They may be restricted to narrow marine 
zones (estuaries; see Roberts et al. 2010) or extend over 
larger areas (Ayres et al. 2004; Strelkov et al. 2007; Nikula 
et al. 2008). Strelkov et al. (2007) analyzed patterns of 
genetic variation in a hybrid swarm of M. balthica that 
extends over hundreds of kilometers in the Norwegian, 
Barents, and Pechora seas. The samples within this swarm 
also showed a component of regional differentiation that 
was not observed in sea bass: within WM, no differentia- 



tion was detected and there was no significant indication 
of differential introgression with AO and EM individuals 
for samples located to the west or the east. 

The random coupling of exo- and endogenous incom- 
patibilities hypothesized by Bierne et al. (2011) can elicit 
emergence of distinct tension zones between successive 
fragmented patches of habitat. Using a two-locus step- 
ping-stone model of gene flow with overdominance, 
Goodisman and Crozier (2001) also demonstrated that 
two spatially distinct tension zones can emerge between 
formerly isolated populations, giving rise to an additional 
central population. Goodisman and Crozier (2001) also 
demonstrated that LD accumulated at the two tension 
zones separating the three successive populations, while it 
disappeared in the central population, as observed in sea 
bass in the WM basin. 

Although sea bass population structure and pattern of 
population differentiation conform to the expectations of 
some models, it is essential to acquire further data to 
investigate the hypothesis of a hybrid swarm in the WM. 
Such caution particularly derives from statistical consider- 
ations. This includes the discrepancy between estimated 
levels of differentiation and estimated number of migrants 
at the STS. It was expected that lower genetic differentia- 
tion should translate in higher number of estimated 
migrants, but this trend was not observed possibly casting 
doubt on this particular result. Furthermore, numerous 
methods for analysis of hybridization processes have 
emerged over recent years. They range from identification 
of F : to F x hybrids and backcrosses (e.g., Anderson and 
Thompson 2002), to evaluating the fitness effects of alleles 
transferred across different genetic backgrounds (i.e., 
introgression; Gompert and Buerkle 2009), to genetic dif- 
fusion models (Stemshorn et al. 2011). Methods aim to 
analyze introgression (admixture) patterns together with 
the dynamics and history of introgression (Nolte and 
Tautz 2010; Lamaze et al. 2012). To derive consistent out- 
puts, however, they require clear recognition of parental 
genotypes (Nolte et al. 2006, 2009; Gagnaire et al. 2011; 
Luttikhuizen et al. 2012). This is typically not observed in 
sea bass, where parental genotypes (AO, but also EM 
metapopulations) are both significantly introgressed, 
hence limiting the capacity for reliable inference. High 
throughput genome-wide genotyping will probably 
provide the statistical power to disentangle recent from 
old and shared from unshared polymorphism across tran- 
sition zones in sea bass. 
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Supporting Information 

Additional Supporting Information may be found in the 
online version of this article at the publisher's website: 

Figure SI. Single-locus estimates of F st as estimated by 9 
(mean ± 95% CI) vs. expected heterozygosity for (a) the 
20 loci used for the Western Mediterranean samples, and 



(b) the 15 loci used for the Eastern Mediterranean sam- 
ples. Relationship between 9 and H exp revealed no signifi- 
cant negative correlations (Western Mediterranean: 
R 2 = 0.062, NS; Eastern Mediterranean: R 2 = 0.129, NS). 
Open and filled diamonds indicate GIL and GAL, respec- 
tively. Labels of individual loci are not reported. 
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